Rank in Wordlist | Frequency | Word |
---|---|---|
6152 | 796 | 1,5 |
8057 | 578 | 2,5 |
10420 | 415 | 1,2 |
11006 | 388 | 3,5 |
12180 | 341 | 1,3 |
12454 | 332 | 4,5 |
13561 | 298 | 1,7 |
14274 | 279 | 1,6 |
14578 | 271 | 1,8 |
14618 | 270 | 1,4 |
Rank in Wordlist | Frequency | Word |
---|---|---|
150203 | 6 | .) |
Rank in Wordlist | Frequency | Word |
---|---|---|
3380 | 1554 | 100% |
3869 | 1333 | 10% |
3998 | 1292 | 20% |
4135 | 1247 | 50% |
4174 | 1234 | 30% |
4644 | 1100 | 40% |
4739 | 1080 | 80% |
5641 | 883 | 25% |
5678 | 875 | 60% |
5959 | 828 | 70% |
Rank in Wordlist | Frequency | Word |
---|---|---|
20391 | 169 | CD&V |
25794 | 119 | R&D |
29418 | 97 | S&P |
31284 | 88 | H&M |
67601 | 25 | AT&T |
77437 | 20 | Gault&Millau |
77893 | 20 | R&B |
101930 | 13 | natur&ëmwelt |
105129 | 12 | S&S |
116757 | 10 | P&R |
Rank in Wordlist | Frequency | Word |
---|---|---|
15806 | 242 | $ US |
79168 | 19 | $ CAN |
90519 | 15 | $ CA |
186743 | 4 | 15$/heure |
216613 | 3 | 15$/h |
234702 | 3 | M$US |
268113 | 2 | A$AP |
355602 | 1 | 000$,dont |
355603 | 1 | 000$. |
355604 | 1 | 000$/an |
Rank in Wordlist | Frequency | Word |
---|---|---|
244 | 18719 | ." |
Rank in Wordlist | Frequency | Word |
---|---|---|
141 | 29648 | d'un |
152 | 28112 | d'une |
193 | 23096 | c'est |
200 | 22181 | qu'il |
217 | 20645 | s'est |
234 | 19260 | C'est |
316 | 15069 | n'a |
333 | 14625 | n'est |
886 | 5887 | qu'elle |
892 | 5846 | d'être |
Rank in Wordlist | Frequency | Word |
---|---|---|
34942 | 74 | 90e+1 |
34943 | 74 | 90e+3 |
38400 | 64 | 90e+2 |
41723 | 56 | 90e+4 |
44721 | 50 | 45e+1 |
54729 | 36 | GMT+1 |
60718 | 30 | 45e+2 |
64593 | 27 | 90e+5 |
67598 | 25 | 90+1 |
77015 | 20 | 90+3 |
Rank in Wordlist | Frequency | Word |
---|---|---|
314369 | 2 | Sagittaire A* |
Rank in Wordlist | Frequency | Word |
---|---|---|
2661 | 2009 | km/h |
2932 | 1805 | https://t |
7241 | 659 | et/ou |
8301 | 557 | P/APC |
10893 | 393 | d’Ivoire/ |
17009 | 219 | 2018/2019 |
17133 | 217 | P/APW |
21470 | 156 | 2017/2018 |
22285 | 148 | Saucourt/L'Est |
23768 | 135 | https://www |
In the last subsection of this type we look for words containing other special characters: , ( ) % & $
" ' + * = / _
Depending on the language some of these characters may be allowed within words, other will not. If words with forbidden characters do not have very low frequency there might be a problem in preprocessing.
Words containing %:
select w_id-100,freq, word from words where w_id>100 and word like "%\%%" limit 10;
3.12.1 Words with Hyphens
3.12.2 Multiwords
3.12.3 (Multi-)Words with dots